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Abstract 



The enormous successes have been made by quantum algorithms dur- 
ing the last decade. In this paper, we combine the quantum game with 
^ the problem of data clustering, and then develop a quantum-game-based 

, clustering algorithm, in which data points in a dataset are considered 

■ as players who can make decisions and implement quantum strategies in 

' quantum games. After each round of a quantum game, each player's ex- 

O ■ pected payoff is calculated. Later, he uses an link-removing-and-rewiring 

(LRR) function to change his neighbors and adjust the strength of links 
connecting to them in order to maximize his payoff. Further, algorithms 
. are discussed and analyzed in two cases of strategies, two payoff ma- 

' trixes and two LRR functions. Consequently, the simulation results have 

^ demonstrated that data points in datasets are clustered reasonably and 

. efficiently, and the clustering algorithms have fast rates of convergence. 

' Moreover, the comparison with other algorithms also provides an indica- 

, tion of the effectiveness of the proposed approach. 

QQ ' Keywords: Unsupervised learning; Data clustering; Quantum compu- 

, tation; Quantum game 

^ '. 1 Introduction 

5t 1 Quantum computation is an extremely exciting and rapidly growing field. 

More recently, an increasing number of researchers with different backgrounds, 
ranging from physics, computer sciences and information theory to mathematics 
and philosophy, are involved in researching properties of quantum-based com- 
putation [T|. During the last decade, a series of significant breakthroughs had 
been made. One was that in 1994 Peter Shor surprised the world by proposing 
a polynomial-time quantum algorithm for integer factorization [5], while in the 
classical world the best-known classical factoring algorithm works in superpoly- 
nomial time. Three years later, in 1997, Lov Grover proved that a quantum 
computer could search an unsorted database in the square root of the time [S] ■ 
Meanwhile, Gilles Brassard et al. combined ideas from Grover's and Shor's 
quantum algorithms to propose a quantum counting algorithm 
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In recent years, many interests focus on the quantum game theory and con- 
siderable work has been done. For instance, D. A. Meyer [S] studied the Penny 
FUp game in the quantum world firstly. His result showed that if a player was 
allowed to implement quantum strategies, he would always defeat his opponent 
who played the classical strategies and increase his expected payoff as well. J. 
Eisert et al. [B] quantized the Prisoners' Dilemma and demonstrated that the 
dilemma could be escaped when both players resort to quantum strategies. A. 
P. Flitney et al. [7] generalized Eisert's result, the miracle move, i.e., the result 
of the game would move towards the quantum player's preferred result, while 
the other player used classical strategies. L. Marinatto et al. [8] investigated the 
Battle of the Sexes game in quantum domain. Their result showed that there 
existed a unique equilibrium in the game, when the entangled strategies were 
allowed. C. F. Lee et al. [H] reported that the quantum game is more efhcient 
than the classical game, and they found an upper bound for this efficiency. Be- 
sides, some experiments about the quantum games have also been implemented 
on different quantum computers [lOl [Til IE] ■ For more details about quantum 
games, see [TB] . 

Successes achieved by quantum algorithms make us guess that powerful 
quantum computers can figure out solutions faster and better than the best 
known classical counterparts for certain types of problems. Furthermore, it is 
more important that they offer a new way to find potentially dramatic algorith- 
mic speed-ups. Therefore, we may ask naturally: can we construct quantum 
versions of classical algorithms or present new quantum algorithms to solve 
the problems in pattern recognition faster and better on a quantum computer? 
Following this idea, some researchers have proposed their novel methods and 
demonstrated exciting results [HI [151 [El [13 [E] ■ 

In addition, data clustering is a main branch of Pattern Recognition, which 
is widely used in many fields such as pattern analysis, data mining, informa- 
tion retrieval and image segmentation. In these fields, however, there is usually 
little priori knowledge available about the data. In response to these restric- 
tions, clustering methodology come into being which is particularly suitable for 
the exploration of interrelationships among data points. Data clustering is the 
formal study of algorithms and methods for grouping or classifying unlabeled 
data points [H]. In other words, its task is to find the inherent structure of a 
given collection of unlabeled data points and group them into meaningful clus- 
ters |19l . In this paper, we attempt to combine the quantum game with the 
problem of data clustering in order to establish a novel clustering algorithm 
based on quantum games. In our algorithms, unlabeled data points in a dataset 
are regarded as players who can make decisions in quantum games. On a time- 
varying network formed by players, each player is permitted to use quantum 
strategies and plays a 2 x 2 entangled quantum game against every one of his 
neighbors respectively. Later, he applies a link-removing-and-rewiring (LRR) 
function to remove the links of neighbors with small payoffs and create new links 
to neighbors with higher payoffs at the same time. Furthermore, the strength 
of links between a player and his neighbors is different from one another, which 
is updated by the Grover iteration. During quantum games, the structure of 
network and the strength of links between players tend toward stability grad- 
ually. Finally, if each player only connects to the neighbor with the highest 
strength, the network will naturally divide into several separate parts, each of 
which corresponds to a cluster. 
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The remainder of this paper is organized as follows: Section 2 introduces 
some important concepts about the quantum computation and the quantum 
Prisoners' Dilemma briefly. In Section 3, the algorithms are established in two 
cases of strategies, payoff matrices and link-removing-and-rewiring (LRR) func- 
tions, and then they are elaborated and analyzed. In Section 4, the relationship 
between the number of nearest neighbors and the number of clusters is dis- 
cussed. Next, the effect of the cost in the SD-like payoff matrix is analyzed, 
and the relationship between the total payoffs and the rates of convergence of 
algorithms is explained. In Section 5, those datasets used in the simulations 
are introduced briefly, and then results of algorithms are demonstrated. The 
conclusion is given in Section 6. 



2 Quantum computation and quantum game 

2.1 Quantum computation 

The elementary unit of quantum computation is called the qubit, which is 
typically a microscopic system, such as an atom, a nuclear spin, or a polarized 
photon. In quantum computation, the Boolean states and 1 are represented by 
a prescribed pair of normalized and mutually orthogonal quantum states labeled 
as {|0), |1)} to form a 'computational basis' [5D]. Any pure state of the qubit 
can be written as a superposition state a|0) -I- /3|1) for some a and /3 satisfying 
|ap -I- = 1 20J. A collection of n qubits is called a quantum register of 
size 71, which spans a Hilbert space of 2" dimensions, so 2" mutually orthogonal 
quantum states can be available. 

Quantum state preparations, and any other manipulations on qubits, have 
to be performed by unitary operations. A quantum logic gate is a device which 
performs a fixed unitary operation on selected qubits in a fixed period of time, 
and a quantum circuit is a device consisting of quantum logic gates whose com- 
putational steps are synchronized in time [20| . The most common quantum gate 
is the Hadamard gate, which acts on a qubit in state |0) or |1) to produce 
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For more details, see [201 EI 



2.2 Quantum Prisoners' Dilemma 

The Prisoners' Dilemma (PD), a well-known example in the classical game 
theory, is an abstract of many phenomena in the real world and it has been 
wildly used in plenty of scientific fields. In this game, each of two players has 
two optional strategies, cooperation (C) and defection (D). Later, he chooses 
one strategy against the other's for maximizing his own payoff, so does the other 
side at the same time, but both sides do not know the opponent's strategy. As 
a result, each player receives a payoff which depends on his selected strategy, 
where the payoff matrix under different strategy profiles is described in Table [T] 
According to the classical game theory, the strategy profile (defection, defection) 
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Table 1: Payoff matrix for the Prisoners' Dilemma. 
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is the unique Nash Equilibrium [221 I23j . but unfortunately it is not Pareto 
optimal 24j. 

In the quantum game, however, thanks to the quantum strategies, the 
dilemma in the classical game can be escaped in a restricted strategic space (SI . 
The physical model of quantum Prisoners' Dilemma presented by Eisert ^ is 
shown in Figure [TJ 
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Figure 1: The block diagram of the system. 

If the possible outcomes of the classical strategies, C = and D — 1^ are 
assigned to two basis vectors {|C = 0), \D = 1)} in Hilbert space respectively, 
then at any time the state of the game may be represented by a vector in the 
space spanned by the basis {|00), |01), |10), |11)} [B]. Assume the initial state of 
the game is jV'o) — J |00), where J is an entangling operator which is known to 
both players. For a two-player game with two pure strategies, the general form 
of J may be written as (HI US] 
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where 7 € [0, 7r/2] is a measure of entanglement of a game. When 7 — 7r/2, 
there is a maximally entangled game, in which the entangling operator takes 
form 

1 



^(7) 
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(3) 



Next, each player chooses a unitary operator YiiY^) from the strategy space 
Si{S2) and operates it on the qubit that belongs to him, which makes the game 
in a state (li ® Y2)J\QQ). Specifically, the unitary operators C and D that 
correspond to the strategies, cooperation and defection, are given below [B] 



C 



1 
1 



D 



1 

-1 



(4) 



In the end, before a projective measurement in the basis {|0), |1)} is carried 
out, the final state is 

= J^(ri®r2)j|oo). (5) 

Thus, the player's expected payoff is written as 

z = i?|(V/|oo)p + 5|(^/|oi)p + r|(^/|io)p + P\{^}\n)?. (6) 

For more details, see [51 [?7]. 



4 



3 Algorithm 



In the section, we will combine the model of the quantum game with the 
problem of data clustering, and then establish clustering algorithms based on 
quantum games. Assume an unlabeled dataset X = {Xi,X2, ■ ■ ■ , Xjv}, which 
are distributed in a m-dimensional metric space. Each data point in the dataset 
is considered as a player in quantum games who can make decisions and always 
hope to maximize his payoff. In the metric space, there is a distance function 
d : X X X — > M, satisfying the closer the two players are, the smaller the 
output is. Based on the distance function, a k nearest neighbors (knn) network 
as a weighted and directed network, Go{X , Eo,d), may be created among data 
points by adding k edges directed toward its k nearest neighbors for each player. 

Definition 1 // there is a set X with N players, X = {Xi, X2, • • • , -^jv}, the 
weighted and directed knn network, Go{X,Eo,d), is created as below. 

f X= {X„i = l,2,--- ,N} 

Eo{i)^{eoiXi,Xj)\jGro{i)} (7) 
^o{i) = |i j = argmink(j^^d{Xi, Xh), Xh € x| 

Here, each player in the set X corresponds to a vertex in the network Go{X, Eq, d); 
Eq is a link set and a link in the network represents certain relationship between 
a pair of players; the distances denote the weights over links; the function, 
argmink{ ), is to find k nearest neighbors of a player which construct a neigh- 
bor set, To{i); the subscript '0' is the initial time step. 

It is worth noting that the strength of links between a player Xi and his 
k nearest neighbors represented by pt-i{Xi, X G rt-i{i){t > 1) is time- 
varying, whose initial values is calculated by 

,^ ^. r i/iroWl = i/fc, jeroW 

Po{Xi,Xj) = i . (8) 

t 0, otherwise 

where the symbol | • | denotes the cardinality of a set. 

After the initial connections are constructed among players (data points), 
on this weighted and directed knn network, a quantum game can be defined as 
following. 

Definition 2 A quantum game fl = {X,Gt, S, Zt} on a network Gt is a 4- 
tuple: X is a set of players; Gt represents the connections among players; S = 
{s{i),i = 1,2, - • • represents a set of players' strategies including the full 
range of quantum strategies; Zt — {zt(i),i = 1.2,- ■■ , A^} represents a set of 
players ' expected payoffs. Here, the variable t denotes the time step ( the number 
of iterations). In each round, players choose theirs strategies simultaneously, 
and each player can only observe its neighbors ' payoffs, but does not know the 
strategy profiles of all other players in X. 
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3.1 Cases of quantum strategies and payoff matrices 

At first, each player selects a strategy from his strategy set, and then plays 
a 2 X 2 entangled quantum game against one of his k neighbors respectively. In 
the classical 2x2 game, such as the Prisoners' Dilemma, usually there are only 
two pure strategies, cooperation and defection, but in the quantum game, one 
can design different unitary operators as strategies, i.e., the strategy set S may 
be identified with some subset of the group of 2 x 2 unitary matrices [5] . Here, 
for the purpose of clustering and simplifying computation, the strategy set of a 
player Xi is restricted in a set 5*1 = {H,I)} or S2 = {Ft-i{i,j),D}, and then 
two cases of strategy sets are described respectively. 

Case 1: 

In this case, a player and his opponent can apply strategies in Si = {H, D}. 
When a player Xi use the Hadamard matrix H as a strategy, his opponent 
(neighbor) Xj has two optional strategies {H, D}, but which strategy is chosen 
is dependent on the strength of the link between them, as is a rule of the 
clustering algorithm. If the strength pt-i{X j, Xi) equals to zero, i.e., there is 
no link directed from the player Xj to the player X^, then the player Xj will 
apply the strategy 'Defection' (D). Alternatively, if pt-i{X j, Xi) > 0, namely 
mutual connections between them, the player X j implements the strategy H. 
If the initial state of the game is \ipo) — J|00), by applying the model of the 
quantum game the final state of the game is 

r i(|00)-z|01)-z|10) + |ll)), HdH 
I ;^(^|00)-|01)), H<E>D 

Case 2: 

The strategy set S2 — {Ft-i{i, j), D} is adopted in the case. The strategy 



i^t-l(«,j) 



y/pt-l{Xi, Xj) y/l - pt-l{X,, Xj ) 
y^l - pt-l{Xi, Xj) -y/pt-l{Xi, Xj) 



is a general form of Hadamard matrix H whose elements are associated with the 
strength of links pt~i{Xi, Xj). When pt-i{Xi, Xj)—0.5, the strategy Ft^i{i,j) 
recovers the strategy H. Similarly, the neighbor Xj, when pt-i{Xj,Xi) = 0, 
applies the strategy 'Defection' {£)), while using the strategy Ft-i{i,j) when 
pt-i{X j, X i) > 0. If the initial state of the game is |V'o) = "^lOO), after their 
moves, the final state of the game is 



^/pTp?|00)- Vp2(1-Pi)|01) 



^^/T~^|00) - ^|01), Ft-i(g>D 

(10) 

According to the payoff matrix, the player's expected payoff can be computed 

by 

zt-iii) = Ejer,_iW zt-i{X„ Xj) 

= E,er. ,(,)i?|(V'/.,|00)p + ^|(V'/,,|01)p+T|(^;,,|10)p + P|(V./,,|ll)p. 

(11) 
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In practice, the payoff matrix takes PD-like or Snowdrift (SD)-like form, de- 
scribed in Table [2] and [31 In the PD-like payoff matrix, the following inequali- 



Table 2: PD-like payoff matrix. 

C = D = l 



C = R = 0.6ujt-i{X^,Xj) 5 = 0.01wt_i(X„Xj) 
D = l T = ut-i{X„Xj) P = Q.2ujt-i{X„Xj) 



Table 3: SD-like payoff matrix. 



C = D = l 



C = R = LOt-i{X,,X,)-^ S = iOt-iiX,,X,)-c 
D = l T ^LOt-i{X,,X,) P = Qmujt^i{X,,Xj) 




ties holds: T > R > P > S and 2i? > T 4- 5 [28] . To avoid a case that a player's 

expected payoff is zero, the variable S in Table [D takes a small value instead 
of zero in Table [TJ In addition, the Snowdrift game assumes that two drivers 
are blocked by a snowdrift, each of whom is in either side of the snowdrift. If 
they want to go back home, one of them or both must shovel a path through 
the snowdrift. So, there exists a cost c in the SD-like payoff matrix and the 
following inequality holds: T > R > S > P [HI, where c = (3 ■ LJt^i{Xi, Xj) 
and /3 is a proportional factor. Besides, the variable Wt-iiXi, X j) in two payoff 
matrices is calculated by the formulation below. 

iOt-i{X„ Xj) - Xj) X Degt-i{Xj)/d{X,, X,) (12) 

3.2 Design of LRR functions 

When all players' payoffs have been computed, each player will observe his 
neighbors' payoffs, and apply a link-removing- and-rewiring (LRR) function Li{-) 
to change his links. A LRR function is defined as below. 

Definition 3 The LRR function Li{-) is a function of payoffs, whose output is 
a set with k elements, namely an updated neighbor set Tt{i) of a player Xi. It 
is given as below. 



Tt{i) ^ Li(zt-i{i)) ^ argmaxk ({zt^i{j),jeTt-i{i)[jTt-i{i)} 

ierf_i(i)UTt-i(j) ^ 

^t-iW - e ri_i(*)UT*-i(*)},T,_i(z) = u,^r+_,w rt-i(j) 

rti« = > OtMi),j e ri_i(z)},r- i(z) = rt^,{{)\r+_,{t) 



where 9t-i(i) is a payoff threshold, Tt-i{i) is called an extended neighbor set, 
and the function argmaxk{-) is to find k neighbors with the first k largest payoffs 
in the setTt^i(i)[JTt-i{i). 

Here, two LRR functions Ll{-) and Lf{-) are designed. The function Ll{-) 
always observes an extended neighbor set formed by half neighbors of a data 



(13) 
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point Xi, a = [ 0.5 x |rt_i(i)| ], where the symbol [•] is to take an integer 
part of a number satisfying the integer part is no larger than the number. Next, 
the payoff threshold Ol-i{i) is set by ^t_i(j) = find"{{zt-i{i),j G Tt-iii)}), 
where the function find°'{-) is to find the a-th largest payoff in a set that 
contains all neighbors' payoffs of the data point. When the LRR function L}{-) 
is applied, the links connecting to the neighbors with small payoffs are removed 
and meanwhile new links are created between the data point and found players 
with higher payoffs. Hence, according to Eq. p^ . the new neighbor set is Tt{i) = 

Unlike the LRR function Lj{-), the LRR function Lf{-) adjusts the number 
of neighbors dynamically instead of the constant number of neighbors in Ll{-). 
Therefore, the payoff threshold 9^_-y{i) takes the average of neighbors' payoffs, 
^t-i(*) = X]jert_i('i) ^t-i{i)/\^t-i{i)\- Next, the set r^^(z) is formed according 
toEq.([ni), r+_i(z) = {j\zt-i{j) > e rt_i(i)}, and then the new neigh- 

bor set is achieved by means of the LRR function Lf{-), Tt{i) = Lf(it_i(i)). 
In the case, when the payoffs of all neighbors are equal to the payoff thresh- 
old 9^_i{i), the output of the LRR function is Tt{i) = Tt-i{i)- This may be 
viewed as self-protective behavior for avoiding a payoff loss due to no enough 
information acquired. 

The LRR function Li{-) expands the view of a player Xi, i.e., it makes him 
observe payoffs of players in the extended neighbor set, which provides a chance 
to find players with higher payoffs around him. If no players with higher payoffs 
are found in the extended neighbor set, namely mm({zt_i(j), j S Tt-i{i)}) > 
max{{zt-i{h), h g Tt_i(z)}), then the output of the LRR function is Tt{i) — 
Tt-i(i). Otherwise, players with small payoffs will be removed together with the 
corresponding links from the neighbor set and link set, and replaced by some 
found players with higher payoff. This process is repeated till the payoffs of 
unlinked players in the extended neighbor set are no larger than those of linked 
neighbors. Since the links among players, namely the link set Eq in the network 
Go{X , Eo,d), are changed by the LRR function, the network Gt{X , Et,d) has 
begun to evolve over time, when t > 1. 



Gt{X,Eud) 



' X(t) = {x,(i),z = l,2,... ,7V} 
Tt{i) = Li{zt-i{i)) 

^ Et{i)^[et{X,,X.j)\jeTt{i)] 



(14) 



3.3 Strength of links updating 



After the LRR function is applied, the strength of links of players needs 
to be formed and adjusted. The new strength of links of a player Xi € X is 
formed by means of the below formulation. 



pt{Xi,Xj 



)\{rt_i(i)nrt(i)} 



pt-i[X,.Xh) 



rtW\{rt_iWnrt(i)} 
. Pt-i{Xi,Xj) 



3 eT^{^)\{Tt-l{^)(^Tt{l)} 
otherwise 

(15) 
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Then, the player adjusts the strength of hnks as follows. First, he finds a 
neighbor X,„ with maximal payoff in his neighbor set, 

m = argmax({zt-i{j)J G Tt{i)}) (16) 

Next, the strength of link pt{Xi, Xj),j G Tt{i) is taken its square root and the 
player X^'s strength of the link becomes negative, 

yptix,,x,),jeTt{z)} ^^^^ 

y^Pt(Xj, Xm) = -y^ptiXi, Xm), m G Tt{i) 

Further, let Avet{i) — {J2j(£rt{i) VPti^i^ ^ m))/\^t{i)\, thus, the updated strength 
of link is 

Pt{X„X,)^ (2 X Avetii) - ^pt{X,,X,)y,j e Tt{i). (18) 

The above-mentioned method is a variant of the Grover iteration G in the 
quantum search algorithm ^3j, a well-known algorithm in quantum computation, 
which is a way to adjust the probability amplitude of each term in a superposi- 
tion state. By adjustment, the probability amplitude of the wanted is increased, 
while the others are reduced. This whole process may be regarded as the in- 
version about average operation 'SI. For our case, each strength of link is taken 
its square root first, and then the average Avet{i) of square roots is computed. 
Finally, all values are inverted about the average. There are three main reasons 
that we select the modified Grover iteration to update the strength of links: (a) 
the sum of updated strength of links retains one, J2j^rt{i) Pti^i^ ^ j) = li (b) 
certain strength of links between a player and his neighbors is much larger than 
the others, pt{Xi, X j) ^ pt{Xi, Xh),h S Tt{i)\j, after the strength of links 
is updated, and (c) it helps players' payoffs to follow a power law distribution, 
in which only a few players' payoffs are far larger than others' in the end of 
iterations. 

Besides, the process that all players adjust their strength is order irrelevant, 
because the strength of links of all players is updated synchronously, and the 
network is a directed network, so the strength cannot be overridden by other 
players. When the strength of links of each player has been updated, an iteration 
is completed. In conclusion, when t > 1 , the structure of network representing 
connections among players begins to evolve over time. 

4 Discussions 

In the section, firstly, the relationship between the number of nearest neigh- 
bors and the number of clusters is discussed, and then how the cost in the 
SD-likc payoff matrix influences the results is analyzed, which provides a way to 
choose the proportional factor. Finally, the total payoffs based on two different 
payoff matrices are compared and the relationship between the total payoffs and 
the rates of convergence of algorithms is explained. 
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4.1 Number of nearest neighbors vs. number of clusters 



The number k of nearest neighbors represents the number of neighbors to 
which a data point (player) Xi S X connects. For a dataset, the number 
k of nearest neighbors determines the number of clusters in part. Generally 
speaking, the number of clusters decreases inversely with the number k of nearest 
neighbors. For example, when the number k of nearest neighbors is small, it is 
indicated that the player Xi connects to only a few neighbors. At this time only 
those not-too-distance neighbors can be observed by the LRR function, which 
means that the elements in the union of the extended neighbor set Tt_i(z) and 
the neighbor set Tt-i{i) are few. Therefore, when the evolution of the network 
formed by players is ended, many small clusters are established among data 
points. On the other hand, a big number k of nearest neighbors provides more 
neighbors for each player, as specifies that the cardinality of the union is larger 
than that when a small k is taken. This means that more neighbors can be 
observed and explored by the LRR function, so that big clusters containing 
more data points are formed. 

For a dataset, the clustering results at the different number k of nearest 
neighbors have been illustrated in Fig. [21 in which each data point only connects 
to one of its neighbors who has the largest strength of link, and clusters are 
represented by different signs. As is shown in Fig. [21 it can be found that 
only a few data points receive considerable links, whereas most of data points 
have only one link. This implies that when the structure of network tends to 
stability, the network, if only the links with the largest strength are remained, 
is characterized by the scale- free network [21], i.e., winner takes all. Besides, in 
Fig. [2Ia), six clusters are obtained by the clustering algorithm, when k = 9. As 
the number k of nearest neighbors rises, four clusters are obtained when k — 12, 
three clusters when k = 16. So, if the exact number of clusters is not known in 
advance, different numbers of clusters may be achieved by adjusting the number 
of nearest neighbors in practice. 






(a) fc = 9 (b) k = 12 (c) k = 16 

Figure 2: The number of nearest neighbors vs. number of clusters, (a) six 
clusters are obtained, when the number of nearest neighbors is fc = 9. (b) four 
clusters, when k — 12. (c) three clusters, when fc = 16 



4.2 Effect of the cost c in the SD-Iike payoff matrix 

In the Snowdrift game, if the cost is too high, the SD-like payoff matrix 
recovers the PD-like payoff matrix [21]. Therefore, the proportional factor /3 
is restricted in an interval (0,0.5]. However, different costs will bring about 
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the changes of the payoff matrix, which means that different clustering results 
will be produced even in the same algorithm. Figure [3] illustrates how the 
clustering results change at the different number k of nearest neighbors when 
the proportional factor /3 takes different values, in which the clustering results 
are represented by clustering accuracies. The definition of clustering accuracy 
is given below. 

Definition 4 clusteri is the label which is assigned to a data point Xi in a 
dataset by the algorithm, and labeli is the actual label of the data point Xi in 
the dataset. So the clustering accuracy is WU^ : 

\{raap{clusteri) ^laheli\ 

accuracy = — — — - 

{1 if map{clusteri) — labeli 
(19) 
otherwise 

where the mapping function map{-) maps the label got by the algorithm to the 
actual label. 

Clustering accuracy is an important evaluation criterion for clustering algo- 
rithms, which reflects the level of matches between the actual labels in a dataset 
and the labels assigned by a clustering algorithm, so that the goal of a clustering 
algorithm is to obtain higher clustering accuracies. 

As is shown in Figure [31 it can be seen that similar results are obtained by 
the algorithm at different costs, and the best results are produced when k = 7 
and k = 8, but from the Figure [31^b), the clustering result with the largest mean 
and the least variance is yielded when /3 — 0.3. Also, as mentioned above, the 
high cost leads to the recovery of the SD-like payoff matrix, so the proportional 
factor (3 — 0.2 or /3 — 0.3 is recommended. 



(a) (b) 

Figure 3: The effect of the cost for the clustering results, (a) the results of 
the algorithm using the SD-like payoff matrix at different number k of nearest 
neighbors, (b) the mean and variance corresponding to each curve in (a). 



4.3 Total payoffs and the rate of convergence 

In this subsection, at first, the total payoffs of algorithms using the PD- or 
SD-like payoff matrix are compared respectively, and then the differences be- 
tween total payoffs are explained. Later, the rates of convergence of algorithms 
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are discussed when two LRR functions are applied respectively, and further the 
impact for the rates of convergence is analyzed in two payoff matrices. 

If all other conditions are fixed, an algorithm will form two versions due to 
using PD- or SD-like payoff matrix, and naturally this will bring different results. 
As compared with the PD-like payoff matrix, the payoff P and S in the SD-like 
payoff matrix have a reverse order in the payoff inequality. In all algorithms, the 
relationship between the total payoffs and the number of iterations is drawn in 
Figure m From Figure HI it can be found that the total payoffs in the algorithms 
with SD-like payoff matrix are larger than that of the algorithms with PD-like 
payoff matrix no matter which LRR function is selected. 




(a) (b) 



Figure 4: The total payoffs vs. the rates of convergence, (a) total payoffs of 
algorithms in the case of LRR function Lj(-), (b) total payoffs of algorithms in 
the case of LRR function Lf{-) 

Remark 1 The algorithms are named as follows. For example, the name of an 
algorithm, QGCIPDLI, denotes that the Case 1, PD-like payoff matrix and the 
LRR function L\[-) are employed in this algorithm. 



According to two payoff matrices, using Eg. pTj) . each player's expected pay- 
off can be calculated in two cases of strategies respectively as below. 
Case 1: 

f i(0.6w + 0.01w + w-h0.2w) = 0.4525W 
¥T):zAXi,XA = { f (20) 
^ " \ i(0.6tJ + 0.01c^) = 0.305c^ ^ ' 



SD : zt{X,,X,) ^ <^ f 2 J (21) 

' ^ i(c.-§+c.-c) = i(2-f )c. 



Case 2: 



Pi/O20.6a) + p2{l — pi)0.01ci; 
¥D : zt{X,,Xj) ^ { +Pi(l-P2)t^+(1-Pi)(l-P2)0.2w (22) 
(l-pi)0.6w + pi0.01cj 

PiP2(w - f ) + P2(l - Pi)(w - c) 
SD:zt{X,,Xj) = { +Pi(l-P2)w + (1-Pi)(l-P2)0.01w (23) 
(l-pi)(w- §) + /3i(w-c) 
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Comparing the expected payoffs in two cases, it can be observed that when 
the SD-hke payoff matrix is used, the expected payoff is larger than that of using 
PD-hke payoff matrix regardless of cases of strategies. Therefore, this explains 
why differences between total payoffs are produced. 

Besides, Figure |4] not only describes the changes of total payoffs of algo- 
rithms, but also reflects the rates of convergence of algorithms. When the total 
payoffs remain constant or fluctuate slightly, this means that the algorithms 
have converged. At this time, players do not frequently apply the LRR function 
to change his neighbors but reach a stable state. As mentioned in the section 
3.2, the LRR function Lf{-) only can observe an extended neighbor set formed 
by those larger-than-average neighbors in contrast to an extended neighbor set 
built by half neighbors in the LRR function i^(-). Generally speaking, for 
the same k, the median of payoffs is smaller than or equal to the mean, i.e., 
9l_i < 9f_i, which means that the exploring area of the LRR function L^(-) 
is larger than that of the LRR function (•)■ So, as a whole, the algorithms 
with the LRR function Lf{-) are slightly faster than that with the LRR function 
L^(-), i.e., the number of iterations that the former type of algorithms needs is 
a little less. 

Furthermore, in the algorithms, a phenomenon that the strategies H and 
Ft-i are more likely used by players in a high density area while the strategy 
D is used by those in a low density area is always observed. This is because 
usually they are mutually neighbors in the high density area on the weighted 
and directed knn network, but in the low density area this case is reverse. As a 
result, the differences of payoffs between the high density and low density areas 
are enlarged rapidly for the expected payoff in the strategy profile {H,H) or 
{Ft-i^Ft-i) is higher than that in other strategy profile, and this also cause the 
distribution of players' payoffs follows a power-law distribution, which is why 
algorithms converge fast. 

5 Simulations 

To evaluate these clustering algorithms, six datasets are selected from UCI 
repository 31J, which are Soybean, Iris, Wine, Sonar, Ionosphere and Breast 
cancer Wisconsin datasets, and complete the simulations on them. In this sec- 
tion, firstly these datasets are briefly introduced, and then the simulation results 
are demonstrated. 

The original data points in above datasets all are scattered in high dimen- 
sional spaces spanned by their features which are the individual measurable 
heuristic properties of the phenomena being observed, where the description of 
all datasets is summarized in Table |H As for Breast dataset, some lost features 
are replaced by random numbers, and the Wine dataset is standardized. Finally, 
the algorithms are coded in Matlab 6.5. 

Throughout all simulations, data points in a dataset are viewed as players 
in quantum games whose initial positions are taken from the dataset. Next, the 
initial network representing relations among data points are created according 
to Def.l, after a distance function is selected, which only needs to satisfy that 
the more similar data points are, the smaller the output of the function is. In 
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Table 4: Description of datasets. 



Dataset 


Instances 


Features 


classes 


Soybean 


47 


21 


4 


Iris 


150 


4 


3 


Wine 


178 


13 


3 


Sonar 


208 


60 


2 


Ionosphere 


351 


32 


2 


Breast 


699 


9 


2 



the simulations, the distance function is employed as following 

=exp(||X, -X,||/2a2),z,j = 1,2,... ,iV (24) 

where the symbol || • || represents L2-norm. The advantage of this function is 
that it not only satisfies our requirements, but also overcomes the drawbacks of 
Euclidean distance. For instance, when two points are very close, the output of 
Euclidean distance function approaches zero, which may make the computation 
of payoff fail due to the payoff approaching infinite. Nevertheless, when Eq. pi]) 
is selected as the distance function, it is more convenient to compute the players' 
payoffs, since its minimum is one and the reciprocals of its output are between 
zero and one, l/d{Xi, X j) e [0, 1]. 

In addition, the distance between a player Xi and itself, d{Xi, Xi), is one 
according to the defined distance function, which means that initially he is one 
of his k nearest neighbors. So there is an edge between the player Xi and 
himself, namely a self-loop. Additionally, the parameter tr in Eq. ([24|) takes one. 
As is analyzed in the section 4.2, the cost c in SD-like payoff matrix is set by 
c — Q.2u!t{Xi, Xj) in the related clustering algorithms. 

The clustering algorithms are applied on the six datasets respectively. Be- 
cause two cases of strategies are designed and the different payoff matrices and 
LRR functions are employed, the algorithms are run on every dataset at the 
different number k of nearest neighbors. As is analyzed in section 4.1, for a 
dataset the number k of clusters decreases inversely with the number of nearest 
neighbors. When a small k is selected, it is possible that the number of clusters 
is larger than the preset number of the dataset, after the algorithm is ended. So 
a merging-subroutine is called to merge unwanted clusters, which works in this 
way. At first, the cluster with the fewest data points is identified, and then it is 
merged to the cluster whose distance between their centroids is smallest. This 
subroutine is repeated till the number of clusters is equal to the preset number. 

The clustering results obtained by these algorithms are compared in Fig. [3 
in which each point represents a clustering accuracy. As is shown in Fig. El the 
similar results are obtained by these algorithms at different number of nearest 
neighbors, but almost all the best results are obtained by the algorithms using 
the LRR function L}{-). 

Further, we show a simple comparison with three other algorithms: Kmeans |321 
155] , PCA-Kmeans [33] , LDA-Km 33J . The Kmeans algorithm is a popular clus- 
tering algorithm because it is easy to implement. It begins with k randomly- 
chosen cluster centers, and then assigns each data point in a dataset to the 
closest cluster center. Next, the cluster centers are recomputed according to the 
current cluster memberships. This process will be repeated till a convergence 
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(a) Soybean dataset (b) Iris dataset (c) Wine dataset 




(d) Sonar dataset (e) Ionosphere dataset (f) breast dataset 



Figure 5: Comparison of clustering accuracies at different k in all proposed 
algorithms. 

criterion is met. However, its major problem is sensitive to the selection of the 
initial partition [19]. The PCA-Kmeans algorithm consists of two steps: (1) Re- 
duce the data dimension by principal component analysis (PCA); (2) Clustering 
data points by the Kmeans algorithm. Ding et al combine linear discriminant 
analysis (LDA) with the Kmeans algorithm, and construct a new clustering 
algorithm called 'LDA-Km'. In the LDA-Km algorithm, Kmeans is used to 
generate class labels, while LDA is used to do subspace selection. Finally, in 
the subspace selection process, data points are clustered. 

Our algorithms are established on the model of quantum games, so data 
points in datasets are considered as players. This idea is totally different from 
traditional clustering algorithms, such as Kmeans and its variants, because in 
traditional clustering algorithms data points for clustering are fixed, and various 
functions or methods are designed to find cluster centers or separating hyper- 
planes, whereas in the quantum-game-based clustering algorithms data points 
(players) themselves choose their clusters, which leads clusters are formed au- 
tomatically during quantum games. In conclusion, traditional clustering algo- 
rithms do it from outside, while ours are from inside. Furthermore, our algo- 
rithms do not need to choose cluster centers at beginning, so there does not exist 
the problem that is "sensitive to the selection of the initial partition". Besides, 
distances between data points are commonly used as the measurement of their 
similarities in the Kmeans algorithm and its variants. However, when similar- 
ities are measured in our algorithms, not only distances between data points 
but also their degrees and the strength of links are integrated, which provides 
more information for the measurement of similarities. Later, according to pay- 
offs in quantum games, players apply a LRR function to change the structure 
of the network, which makes the clusters emerge. As a result, better clustering 
accuracies are obtained by our algorithms. 

Table E] displays the clustering accuracies of our algorithms and the other 
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three algorithms using the datasets in Table U) It can be observed that on 
almost all datasets the quantum game based clustering algorithms have the 
best clustering accuracies. Only on the Iris dataset, the LDA-Km algorithm is 
better than ours, but our result is close to it as well. The Iris dataset contains 
three clusters, one of which is far from the other two, so data points in it 
can be clustered easily and correctly. The other clusters in the Iris dataset, 
however, are mixed partly in their boundaries, which brings a difficulty for 
clustering. These mixed boundary points may be assigned to different clusters 
by algorithms, which is why the clustering accuracies of algorithms are various. 
It is more difficult to cluster data points in the Sonar and Ionosphere dataset, 
because in these two datasets plenty of data points belonging to different clusters 
are mixed together. Therefore, the clustering accuracies of algorithms in them 
are not high, and it is rather hard to improve the clustering accuracies even 
by several percents. From the fifth and sixth columns in Table [5l it can be 
seen that the quantum game based clustering algorithms are better than other 
algorithms, which indicates the effectiveness of our algorithms. 



Table 5: Comparison of clustering accuracies of algorithms. 



Algorithm 


Soybean 


Iris 


Wine 


Sonar 


Ionosphere 


Breast 


QGCIPDLI 


97.9% 


92.0% 


94.9% 


54.8% 


75.5% 


95.4% 


QGCISDLI 


95.6% 


90.7% 


96.6% 


56.3% 


74.1% 


95.4% 


QGC2PDL1 


95.6% 


91.3% 


96.6% 


55.8% 


73.8% 


95.4% 


QGC2SDL1 


89.4% 


91.3% 


96.6% 


55.8% 


75.5% 


95.3% 


QGC1PDL2 


95.8% 


96.7% 


95.5% 


54.3% 


74.1% 


95.0% 


QGC1SDL2 


89.4% 


90.7% 


95.5% 


54.3% 


74.6% 


95.1% 


QGC2PDL2 


89.4% 


90.0% 


95.5% 


55.3% 


74.6% 


95.3% 


QGC2SDL2 


89.4% 


91.3% 


94.9% 


55.8% 


74.9% 


95.1% 


Kmeans [31] 


68.1% 


89.3% 


70.2% 


47.2% 


71.0% 




PCA-Kmeans [33] 


72.3% 


88.7% 


70.2% 


45.3% 


71.0% 




LDA-Km m\ 


76.6% 


98.0% 


82.6% 


51.0% 


71.2% 





6 Conclusions 

The enormous successes gained by the quantum algorithms make us realize 
it is possible that the quantum algorithms can obtain solutions faster and bet- 
ter than those classical algorithms for some problems. Therefore, we combine 
the quantum game with the problem of data clustering, and establish clustering 
algorithms based on quantum games. In the algorithms, data points for clus- 
tering are regarded as players who can make decisions in quantum games. On 
a weighted and directed knn network that represents relations among players, 
each player uses quantum strategies against every one of his neighbors in a 2 x 2 
entangled quantum game respectively. We design two cases of strategies: (i) one 
plays the strategy H, the other plays the strategy 77 or Z), (ii) one plays the 
strategy Ft_i, the other plays the strategy Ft_i or D according to the strength 
of links, in each of which players' expected payoffs are calculated based on the 
PD- and SD-like payoff matrices respectively. According to neighbors' payoffs 
in one's neighbor set, each player applies a LRR function {Ll{-) or Lf{-)) to 
change his neighbors, i.e., the links connecting to neighbors with small payoffs 
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are removed and then new links are created to those with higher payoffs. Later, 

the Grovcr iteration G is employed to update the strength of links between 
him and his neighbors. In the process of playing quantum game, the structure 
of the network formed by players tends to stability gradually. In other words, 
each player always connects to one of his neighbors with the largest strength or 
jumps among some neighbors with the largest strength in a constant period. At 
this time, if only the links with the largest strength are left but all other links 
are removed among players, the network naturally divides into several separate 
parts, each of which corresponds to a cluster. 

Additionally, in simulations, it can be found that the total expected payoffs of 
algorithms using SD-like payoff matrix are higher than that of algorithms using 
PD-like payoff matrix. Later, the reason is explained. Further, we observe that 
the rates of convergence of the algorithms employing the LRR function if (•) 
arc faster slightly than that of the algorithms employing the LRR function 
because more areas are explored by the LRR function L\{-), but this 
brings better clustering results. In the case when the exact number of clusters 
is unknown in advance, one can adjust the number k of nearest neighbors to 
control the number of clusters, where the number of clusters decreases inversely 
with the number k of nearest neighbors. Finally, the clustering algorithms are 
evaluated on six real datasets, and simulation results have demonstrated that 
data points in a dataset are clustered reasonably and efficiently. 
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